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Figurel: Flow diagram of GeneDecipher 



ATGCCTAAGTACCGTTCCGCCACCACCACT. 

Translate in all 6 frames 



..C ACCGG A ATG ACCG ACGCCG ATTTCGGTA A . 

Nucleotide string 



3 forward frames 



.mPKYRSATTT HRNDRRRFRz.. 

...CLSTVPPPPL TGmTDADFG.. 

. . . AzVPFRHHH PEzPTPISV. . . 



Hypothetical 
proteins in 6 frames 



Search each overlapping heptapeptide in the 
library and report occurrence profile. Peptides 
starting with 'm' is replaced by *s' and those 
containing 'z* are replaced by 



...sill 1 1 1447. ..000*******. 
000000000 sOOOOOOs... 



6 Integer coded strings 



♦0000.. 



******* 



Split the integer strings into fragments with start ('s') 
coded by ATG, GTG, TTG and stop codon(***) coded 
by TTA, TAG and TGA. Seven consecutive **' in the 
integer coded sequence denotes end of a gene. 



The integer represents number 
of organisms in which the 
heptapeptide is present in the 
library. More than 9 occurrence 
value is treated as 9. 



Peptide Library format 



All possible coding regions (ORFs) 



ANN trained on E. coli-K12 
genome 



Predicted protein coding regions 
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Figure 2: Architecture of Artificial Neural Network 




Input layer 

5 Inputs N = 30 



Hidden layer 
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Fig No. 3 Sensitivity and Specificity of the Method of Invention 



